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B&P File No. 936M40/UG 
Title; METHOD FOR CLEAVAGE OF FUSION PROIHNS 
PTCT n r>P TWF INVENTION 

The present invention relates to an improved method for 
5 recovering recombinantly produced polypeptides- The method 
involves expressing the recombinant polypeptide as a fusion protein 
vnth a pro-peptide. The pro-peptide-polypeptide fusion protein can be 
cleaved and the recombinant polypeptide released under the 
appropriate conditions. 
10 BACKGRQUNn OF THE INVENTION 

Several enzymes, such as pepsin and chymosin, are 
synthesized in vivo as inactive precursors ^or z5anogens. Zymogens 
become activated xmder appropriate conditions, which in the case of 
pepsin and chymosin are provided by the acidic environment of the 
15 mammal stomach. In the process, two interrelated events occur: (1) 
the zymogen is activated by a conformational change in the protein 
(activation step) and sequentially or simultaneously (2), an amino- 
terminal peptide, referred to in the art as the pro-peptide, is cleaved 
from the zymogen to form the mature protein (processing step). The 
20 activation and processing of zjrmogens has been extensively 
characterized (McCaman and Cummings, 1986, J. Biol. Chem. 261: 
15345-15348) although so far the process has only been studied using 
the pro-peptide linked to its homologous mature protein. 

An important factor for the commercial development of 
25 biotechnology is the purification of recombinant DNA products from 
expression systems, which typically accounts for 50% or more of the 
costs (Labrou, N. and Qonis, Y.D., (1994), J. Biotechn-, 36 95-119). Many 
recombinant DNA products are produced as fusion proteins. 
Consequently, one step in the purification process involves an 
30 enzymatic or chemical cleavage step in which the produced protein is 
processed to its mature form. Very few proteolytic enzymes cleave 
fusion proteins specifically and efficiently. In addition, the cxirrently 



widely xised proteolytic enzymes, such as blood dotting factor )Ca and 
thrombin, are expensive, and contamination of final product with 
blood pathogens is a consideration. 

rnnRPfluentlv, there is a need for an improved process for 

^ — V - 

5 recovering recombinantly produced polypeptides 'from their 
expression systems. 
STTMlVff ARY OF THK TNTVENTION 

The present inventors have developed a novel method for 
recovering recombinantly produced polypeptides. The method 

10 involves expressing the polypeptide as a fusion protein with a pro- 
peptide so that the recombinant polypeptide can be cleaved from the 
pro-peptide xmder the appropriate conditions. 

In one aspect, the invention provides a chimeric DNA 
sequence encoding a fusion protein, the chimeric DNA sequence 

15 comprising a first DNA sequence encoding a pro-peptide and a second 
DNA sequence encoding a polypeptide that is heterologous to the pro- 
peptide. 

In another aspect, the present invention, provides a method 
for the preparation of a recombinant polypeptide comprising 
20 (a) introducing into a host cell an expression vector comprising: 

(1) a first sequence capable of regulating transcription in a 
host cell, operatively linked to 

(2) a second chimeric DNA sequence encoding a fusion 
protein, said chimeric DNA sequence comprising a first 

25 DNA sequence encoding a pro-peptide, linked in reading 

frame to a second DNA sequence heterologotis to said 
pro-peptide and encoding said recombinant polypeptide; 
operatively linked to 

(3) a third DNA sequence encoding a termination region 
30 functional in the host cell, 

(b) growing said host cell to produce said fusion protein; and 



(c) altering the environment of said fusion protein so that said 

pro-peptide is cleaved from said fusion protein to release said 
recombinant polypeptide. 

Other objects, features and advantages of the present 
5 invention will become apparent from the following detailed 
description. It should be imderstood, however, tfiat the detailed 
description and the specific examples while indicating preferred 
embodiments of the invention are given by way of illustration only, 
since various changes and modifications within ttie spirit and scope of 
10 the invention will become apparent to those skilled in the art of this 
detailed description. 

BRIEF PESCRIPTTONf OF THF DRAWINGS 

The invention will now be described in relation to the 
drawings in which: 
15 Figure 1 is the nucleic add and deduced amino acid sequence 

of a GST-Chymosin pro-peptide-Hirudin sequence. 
OFT An.FP D Ff^rRIPTIQN OF THF INVENTION 

As hereinbefore mentioned, the present invention relates to 
a novel method for preparing and recovering recombinant 

20 polypeptides. 

Accordingly, the present invention provides a method for the 
preparation of a recombinant polypeptide comprising: 
(a) introducing into a host cell an expression vector comprising: 

(1) a first sequence capable of regulating transcription in a 
25 host cell, operatively linked to 

(2) a second chimeric DNA sequence encoding a fusion 
protein, said chimeric DNA sequence comprising a first 
DNA sequence encoding a pro-peptide, linked in reading 
frame to a second DNA sequence heterologous to said 

30 pro-peptide and encoding said recombinant polypeptide, 

operatively linked to 



(3) a third DNA sequence encoding a termination region 
functional in ssdd host cell, 

b) growing said host cell to produce said fusion protein; and 

c) altering the enviromnMit of said fusion protein so that said 
5 pro-peptide is cleaved from said fusion protein to release said 

recombinant polypeptide. 

The term "pro-peptide" as used herein means the amino 
terminal portion of a zymogen up to the maturation site. 

The polypeptide can be any polypeptide that is heterologous 
10 to the pro-peptide meaning that it can not be the mature protein that is 
normally associated with the pro-peptide as a zymogen. 

The invention also includes a chimeric DNA sequence 
encoding a fusion protein, the chimeric DNA sequence comprising a 
first DNA sequence encoding a pro-peptide and a second DNA 
15 sequence encoding a polypeptide that is heterologous to the pro- 
peptide. 

The chimeric DNA sequences which encode the fusion 
proteins of the present invention can be incorporated in a known 
manner into a recombinant expression vector which ensures good 
20 expression in a host cell. 

Accordingly, the present invention also includes a 
recombinant expression vector comprising a chimeric DNA molectile 
of the present invention operatively linked to a regulatory sequence 
and termination region suitable for expression in a host cell. 

25 The term "suitable for expression in a host cell" means that 

the recombinant expression vectors contain the chimeric DNA 
sequence of the invention a regulatory sequence and a termination 
region, selected on the basis of the host cells to be used for expression, 
which is operatively linked to the chimeric DNA sequence, 

30 Operatively linked is intended to mean that the chimeric DNA 
sequence is linked to a regulatory sequence and a termination region 
in a manner which allows expression of the chimeric sequence. 
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Regulatory sequences and termination regions are art-recognized and 
are selected to direct expression of the desired protein in an 
appropriate host cell. Accordingly, the term regulatory sequence 
includes promoters, enhancers and other expression control elements. 
5 Such regulatory sequences are known to those skilled in the'art or one 
described in Goeddel, Gene Expression Technology: Methods in 
Enzymology 185, Academic Press, San Diego, CA (1990) can be used. It 
should be understood that the design of the expression vector may 
depend on such factors as the choice of the host cell to be transformed 
10 and/or the type of protein desired to be expressed. Such expression 
vectors can be used to transform cells to thereby produce fusion 
proteins or peptides encoded by nucleic adds as described herein. 

The recombinant expression vectors of the invention can be 
designed for expression of encoded fusion proteins in prokaryoHc or 
15 eukaryotic cells. For example, fusion proteins can be expressed in 
bacterial cells such as E. coli, insect cells (using baculovirus), yeast cells, 
plant cells or mammalian cells. Other suitable host cells can be foxmd 
in Goeddel, Gene Expression Technology: Methods in En2ymology 185, 
Academic Press, San Diego, CA (1990). 
20 Expression in prokaryotes is most often carried out in E. coli 

with vectors contaiiung constitutive or inducible promotors directing 
the expression of the fusion proteins. Inducible expression vectors 
include pTrc (Amann et al., (1988) Gene 69:301-315) and pET lid 
(Studier et al.. Gene Expression Technology: Methods in Enzymology 
25 185, Academic Press, San Diego, California (1990) 60-89). While target 
gene expression relies on host RNA poljrmerase transcription from the 
hybrid trp-lac fusion promoter in pTrc, expression of target genes 
inserted into pET lid reUes on transcription from the T7 gnlO-lac 0 
fusion promoter mediated by coexpressed viral RNA polymerase (T7 
30 gnl). This viral polymerase is supplied by host strains BL21(DE3) or 
HMS174PE3) from a resident X prophage harboring a T7 gnl under 



the transcriptionaUcpntrol of the lacUV 5 promoter. Another 
attractive bacterial expression system is the pGEX expression system 
(Pharmacia) in which genes are expressed as fusion products of 
glutathione-S-transferase (GST), aUowing easy purification of the 
5 expressed gene from a GST affinity column. 

One strategy to mjodmize recombinant protein expression in 
E. coU is to express the protein in host bacteria with an impaired 
capacity to proteolytically deave the recombinantly expressed proteins 
(Gottesman, S., Gene Expression Technology: Methods in Enzymology 
10 185. Academic Press, San Diego, California (1990) 119-128). Another 
strategy is to alter the nucleic add sequence of the chimeric DNA to be 
inserted into an expression vector so that the individual codons for 
each amino acid would be those preferentially utilized in highly 
expressed E. coli proteins (Wada et al., (1992) Nuc. Acids Res. 20: 2111- 
15 2118). Such alteration of nucleic add sequences of the invention could 
be carried out by standard DNA synthesis techniques. 

Examples of vectors for expression in yeast S. cerivisae 
indude pYepSecl (Baldari. et al., (1987) Embo J. 6:229-234), pMFa 
(Kurjan and Herskowitz, (1982) Cell 30:933-943), pJRY88 (Schultz et al., 
20 (1987) Gene 54:113-123), and pYES2 (Invitrogen Corporation, San 
Diego, CA). 

Baculovirus vectors available for expression of proteins in 
cultured insect ceUs (SF 9 cells) include the pAc series (Smith et aL, 
(1983) Mol. CeU Biol. 3:2156-2165) and the pVL series (Lucklow, VJ\.., 

25 and Summers, M.D., (1989) Virology). 

The polypeptide of the present invention may be any 
polypeptide that is not normally fused to the pro-peptide used in the 
method^ The polypeptide is preferenttaUy stable under cleavage 
conditions, for example at acidic pH, and the polypeptide may be 

30 activated after cleavage upon adjusting the pH, or altering the 
environment otherwise so that conditions optimal for enzymatic 
activity are generated. 
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The pro-peptide used in the present invention may be any 
pro-peptide including pro-peptides derived from aspartic proteases, 
serine proteases, lipases, and esterases. In preferred embodiments of 
the invention, the pro-peptide is derived from chymosiiv trypsinogen, 
5 pepsin or insulin. The invention also includes the full* length pro- 
peptide as well as portions of the pro-peptide or mutated forms of the 
pro-peptide. Mutated forms of the pro-peptide may be used to obtain 
specific cleavage between the pro-peptide and a heterologous protein. 
Mutations in the pro-peptide could alter the optimal conditions, such 
10 as temperature, pH and salt concentration, imder which cleavage of a 
heterologous peptide is achieved (McCaman, M.T. and Cimunings, 
D.B., (1986), J. Biol. Chem. 261:15345-15348). Depending on the pro- 
peptide, cleavage of the heterologous protein from various pro- 
peptides, will be optimal under varying different conditions. Thus the 
15 invention will be amenable to heterologous proteins which are 
preferentially cleaved imder a variety of desirable conditions. 

The DNA sequence encoding the heterologous polypeptide is 
preferably fused downstream of the DNA sequence encoding the pro- 
peptide and concatamers contaiiung repetitive imits of pro-peptide- 
20 heterologous protein may be employed. 

In one embodiment, the pro-peptide is a pro-peptide derived 
from chymosin and the heterologous polypeptide is hirudin (Dodt et 
al., 1984, FEBS Letters 65:180-183). In particular, the present inventors 
have constructed a chimeric DNA sequence in which the DNA 
25 encoding the chymosin pro-peptide was fused upstream of the DNA 
sequence encoding the leech anticoagulant protein hirudin. The gene 
fusion (Pro-Hirudin) was expressed in E. coli cells. It was foimd diat 
upon lowering of the pH to pH 2, and more preferably to pH 45, and in 
the presence of a small quantity of mature chymosin, the 
30 heterologously fused protein, hirudin, was efficientiy cleaved from ttie 
chymosin pro-peptide. 



Although the inventors have employed matxire chymosin to 
assist in the cleavage reaction, it is conceivable that the use of other 
pro-peptides may not require the addition of the mature peptide in 
order to accomplish efficient deavage, 
5 Activation of the fusion protein may be in vitro or in vivo. 

In one embodiment, the pro-peptide is used to fadUtate deavage from 
proteins recombinantly produced on oil bodies as disdosed in PCT 
appUcatlon PubUcation No. WO 96/21029. In this embodiment, the 
pro-peptide would be fused downstream of an oil body protein and 
10 upstream of the recombinant protein or peptide of interest 

In another in vivo application, two vectors would be 
introduced in the same host. In one vector expression of the zymogen 
or the mature protein wotdd be controlled by an inducible promoter 
system. The other vector would comprise a pro-peptide fused 
15 upstream of an heterologoxxs protein of interest. Thus it is possible to 
control the moment of deavage of the peptide or protein downstream 
of the pro-peptide through the promoter which controls expression of 
the zymogen or the mature protein. Alternatively, the two expressed 
genes would be combined in the same vector. In preferred 
20 embodiments of this application, the pro-peptide employed is deaved 
imder physiological conditions. 

A peptide or protein delivery application is also envisaged. 
In this embodiment, a therapeutic or nutritional peptide or protein 
would be employed as an inactive fusion protein. Activation or 
25 maturation through cleavage would only occur upon its delivery at 
the imique physiological conditions prevalent at the target organ or 
tissue. Cleavage might be erJianced by a protease specific for the 
peptide. This method is particularly useful for the deUvery of orally 
ingested vaccines, cytokines, gastric lipase, peptide antibiotics, lactase 
30 and cattle feed enzymes which facilitate digestion, such as xylanase and 
cellulase. For example, a therapeutic or nutritional peptide or protein 
fused downstream of the chymosin pro-peptide might be activated in 



the mammal stomach upon ingestion of chymosin or the inactive 
precursor form of chymosin. 

The present invention is also useful in the purification 
process of recombinant proteins. In one embodiment, a cell extract 

5 containing an expressed pro-peptide-heterologous fusioTn protein is 
appUed to a chromatographic column. Selective binding of the fusion 
protein to antibodies raised against the pro-peptide sequence and 
immobiUzed onto the column, results in selective retention of the 
fusion protein. Instead of relying on antibodies against the pro-peptide 

10 sequence, a gene encoding another immimogenic domain or a gene 
encoding a peptide with affinity for a commonly used column 
material, such as ceUulose or chitin, or any other desirable tag, may be 
included in the gene fusion. 

The following non-limiting examples are iUustrative of the 

15 present invention. 
EXAMPLES 
EXAMPLE 1 

In the first example, the protein hirudin was prepared as a fusion 
protein with the chymosin pro-peptide. 

Construction of a pGEX-Pro-Hirudin fusion. 

The fusion protein that we studied comprises the pro-peptide 
of calf chymosin B (Folbnann et al, 1977; Harris et al., 1982, Nucl. 
Acids. Res., 10: 2177-2187) fused to hirudin variant 1 (Dodt et aL, 1984, 

25 FEES Letters 65: 180-183). The hybrid gene which encoded this fusion 
protein was constructed using standard PCR methods (Horton et aL, 
1989, Gene, 77r 61-68). The DNA sequence for this Pro-Hirudin fusion 
was cloned into pGEX-4T-3 (Pharmacia), downstream of the gene 
encoding glutathion-S-transferase (GST). The complete sequence of 

30 the GST-Pro-Hirudin sequence is shown in Figure 1. 
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Growth of E, coli transf nned with pGEX-4T-3 and pGEX-Pro-Himdiiu 
Plasmids pGEX-4T-3 and pGEX-Pro-Hirudin were 

transformed into E. coll strain DH5a to allow for high level of 

• . . ;i _• 1- »rao ftsoA w\ iTt/v nla tip .'iTnl LB-amD broth. 

expression. /\ »uigic wiwijr ^ 

5 These cultures were grown overnight One ml of ead\ b/n culture was 
used to inoculate 50 ml of LB-amp broth. These cultures were grown 
until OD6oo= 0-6- At this OD, IPTG (final concentration ImM) was 
added to induce the expression of the GST and GST-Pro-Hirudin 
fusion proteins. After this induction, the cultures were grown for ah 
10 additional 3 hours at 3TC. The ceUs were pelleted at 5000 x g for 10 
minutes, and resuspended in 5 ml Tris Buffered Saline (TBS). The 
resuspended ceUs were sonicated and centrifuged at 12000 x g for 15 
minutes to separate the inclusion bodies (pellet fraction) from the 
soluble proteins (supernatant fraction). Western blotting of both the 
15 pellet and supemation fraction indicated that under the growing 
conditions described above, significant amounts (5-10%) of the GST 
and GST-Pro-Hirudin protein were foimd in the supernatant fraction. 
The rest (90-95%) accumulated ia inclusion bodies (results not shown). 
Hirudin activity measurements 
20 The supernatant fractions of both the GST and GST-Pro- 

Hirudin were tested for anti-thrombin activity. The samples were 
treated as follows: A) 20 \ll supernatant + 20 \l\ water B) 20 p.1 
supernatant + 20 ^1 of 100 mM Sodium Phosphate pH 2.00 20 ]i\ 
supernatant + 20 jil of 100 mM Sodium Phosphate pH 2.0 + 2 jig 
25 chymosin (Sigma) D) 20 jil supernatant + 20 of 100 mM Sodium 
Phosphate pH 4.5 E) 20 jil supernatant + 20 of 100 mM Sodium 
Phosphate pH 4.5 + 2 fig chymosin. These samples were incubated at 
room temperature for 1 hour. A total of 10 jU of the samples was 
added to 1 ml assay buffer (20 mM Tris [pH 7.51, 100 mM NaCl, 5 mM 
30 CaClz, 0.1 imit of thrombin) and incubated for 2-3 minutes before the 
addition of SOjil p-tos-gly-pro-arg-nitroanilide (1 mM). Thrombin 
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activity was measured as a function of chromozyme cleavage by 
monitoring the increase in absorption at 405 nm over time (Chang, 
1983, FEBS Letters, 164: 307-313). The AAbs (405nm) was determined 
after 2 minutes. The result of the activity measurements are indicated 
in Table 1. 

As can be seen from Table 1, the only extract which exhibited 
significant anti-thrombin activity was the extract containing the GST- 
Pro-Hirudin fusion which was treated at pH 4.5 and supplemented 
with 2 ng chymosin (E). Western blotting (results not shown) 
indicated that apart from treatment at pH 4.5, complete cleavage was 
also observed when the GST-Pro-Hirudin hision which was treated at 
pH 2.0 and supplemented with 2 ng chymosin. It has been well 
documented that unprocessed chymosin when exposed at pH 2.0, 
forms a pseudochymosin, before it matures into chymosin (Foltmann 
15 et al., 1977, Scand. J. Clin. Lab. Invest. 42: 65-79; Foltmann, 1992, Proc. 
Natl. Acad. Sci. 74: 2321-2324; McCaman and Cummings, 1988, J. Biol. 
Chem. 261: 15345-15348) The pseudo chymosin cleavage site is located 
between the Phe^^-Leu^s peptide bond and is indicated in Figure 1. 
The inabiUty of the GST-Pro-Hirudin fusion, which was treated at pH 
20 2.0 and supplemented with 2 \ig chymosin, to inhibit thrombin 
activity can be explained by the fact that cleavage occurred at the Phe^^- 
Leu28 peptide bond rather than at the Phe^-Val** peptide bond which 
separates the chymosin pro-peptide from the mature hirudin. It has 
been weU documented that (Loison et al., 1988, Bio/Technology, 6: 72- 
25 77) mature hirudin is orUy active when it does not have any additional 
amino acids attached to its native N-terminal sequence. 

While the present invention has been described with 
reference to what are presently considered to be the preferred 
examples, it is to be understood that the invention is intended to cover 
30 various modifications and equivalent arrangements included within 
the spirit and scope of ti\e appended claims. 
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All publications, patents and patent applications are herein 
incorporated by reference in their entirety to the same extent as if each 
individual publication, patent or patent application was specifically 
and individuaUy indicated to be incorporated by reference in its 
entirety. 
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DETAILED HGURE LEGENDS 

Figure 1. The nucleic acid and deduced amino add sequence of a GST- 
Pro-Hirudin sequence. The deduced sequence of the chymosin pro- 
peptide has been underlined and tfie deduced hirudin protein 
sequence has been italicized. The hirudin nucleic add sequence was 
optimized for plant codon usage. The pseudochymosin deavage site 
between Phe27-Leu28 and the peptide bond separating the pro- 
chymosin and mature hirudin (Phe 42- Val43) are indicated witii an 

arrow (T). 
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Tablel: Activity measurements f bacterial extracts c ntaining GST 
(Glutathion-S-transferase) and GST-Pro-Hirudin fusi ns.. 







A Abs (4D5iun)/2min 


A Abs (40%un)/2ixun 






[Tcstll 






1 unit Thrombin 


0.088 


0.066 




A: GST 


0.087 


0.082 ll 






0.082 


0.073 




C: Cab 1 prl Z.U +z |ig cnymosm 


0.063 


0.073 






0.087 


0.086 


a 


E: GST pH 43 +2 p.g diymosin 


0.087 


0.087 


ft = 

... Sir 

^^ J.,- in 


A: GST-PROHIR 


0.076 


0.071 


B: GST-PROHIR pH 2.0 


0,072 


0.064 




C: GST-PRO-HIR pH 2.0 +2 jig 


0.066 


0.070 




chymosin 








D: GST-PROHIR pH 45 


0.078 


0.075 




E: GST-PROHIR pH ^S +2jtg chymosin 


0.0002 


0.0001 




Hirudin 2^g 


0.0001 


0.0001 
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We qaim: 

1. A method for the preparation of a recombinant polypeptide 
comprising 

a) introducing into a host cell an expression vector comprising: 

(1) a first sequence capable of regxilating transcription in a 
host cell, operatively linked to 

(2) a second chimeric DNA sequence encoding a fusion 
protein, said chimeric DNA sequence comprising a first 
DNA sequence encoding a pro-peptide, linked in reading 
frame to a second DNA sequence heterologous to said 
pro-peptide and encoding said recombinant polypeptide; 
operatively linked to 

(3) a third DNA sequence encoding a termination region 
functional in said host cell, 

b) growing said host cell to produce said fusion protein; and 

c) altering the environment of said fusion protein so that said 
pro-peptide is cleaved from said fusion protein to release said 
recombinant polypeptide. 

2. A chimeric DNA sequence according to claim 1 wherein the 
pro-peptide is derived from aspartic proteases, serine proteases, 
lipases, or esterases. 

3. A chimeric DNA sequence according to claim 1 wherein the 
pro-peptide is a pro-peptide derived from chymosin, insulin, 
tripsinogen or pepsin. 



4. A chimeric DNA sequence according to claim 1 wherein said 

polypeptide is hirudin. 



5. An expression vector comprising a chimeric DNA sequence 
according to claim 1 and a regulatory sequence suitable for expression 
in a host cell. 

6. A transformed host cell containing an expression vector 
according to claim 6. 

7. A method according to claim 1 wherein said altering the 
environment comprises altering the pH to pH 2 or to pH 4.5. 

8. The method according to claim 1 wherein a mature protein of 
the pro-peptide is added in step (c). 
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B&P FUe No. 9369-040/MG 

ABSTRACT OF THP niSCLOSURE 

An improved method for recovering recombinandy produced 
polypeptides is described. The method involves expressing the 
recombinant polypeptide as a fusion protein with a pro-peptide. The 
pro-peptide-polypeptide fusion protein can be cleaved and the 
10 recombinant polypeptide released imder the appropriate conditions. 



FIGURE 1 

Nucleotide sequence and the deduced amino acid sequence of an GST-PrD=^ 
Hirudin fusi n. 

lATCTOCCCTiO'ACTAGGrrTOTTOTAAAiOTAMCXXrCCT 60 

... ^^^-^vuttrTV-RT. VOPTRLIi 20 

eiTOXSGAATATCTTGAAGAAAAATATGAAGitfSCATTTGTA^ 

21LEYLEEKYEEHLYERDEGDK 40 

121TX»CC5AAACAAAAAGTTTGAATTGGC3TTIGGMT^ ^.80 

41WRNKKFEI,GLEFPNLPYYID 60 

181 QGT GAT <OT AAA TTA ACA CAG TCT ATG GCC ATC ATA CCn^ 240 

61GDVKLTQSMAIIRYIADKHN 80 

241 ATC TTO GCT GOT TC3T CCA AAA GAG CGT GCA GAG ATT TCA ATG CTT GAA GGA OCG CTTT TTG 300 

81MLGGCPKERAEISMLEGAVL 100 

301 GAT ATT AGA TAC QGT GTIT TCG AGA ATT GCA TAT AGT AAA GAC TTT GAA ACT CTC AAA GTT 360 

lOlDIRYGVSRIAYSKDFETLKV 120 

361 GAT TTP CTT AQC AAG CTA OCT GAA ATG CTG AAA ATG TTC GAA GAT CGT TTA TOT CAT AAA 420 

121DFLSKLPEMLKMFEDRLCHK 140 

421 ACA TAT TTA AAT GGT GAT CAT GTA ACC CAT CCT GAC TTC ATG TTG TAT GAC GCT CTT GAT 480 

141TYLNGDHVTHPDFMLYDALD 160 

481 CTT CTT TEA TAC ATC GAC CCA ATG TGC CTG GAT GOG TTC CCA AAA TTA GTT TGT TTT AAA 540 

161VVLYMDPMCLDAFPKLVCFK 180 

541 AAA GOT ATT GAA GCT ATC CCA CAA ATT GAT AAG TAC TTG AAA TCC AGC AAG TAT ATA GCA 600 

181KRIEAIPQIDKYLKSSKYIA 200 

601 TGG CCT TTG CAG GGC TC3G CAA GCC ACG TTT GGT GGT GGC GAC CAT CCT OCA AAA TCG GAT 660 

201WPLQGWQATFGGGDHPPKSD 220 

661 CTC CTT COG CGT GGA TCC CCG AAT TCC COG GTC GAC TCG AGC GGC CGC GCT GAG ATC ACC 720 
221LVPRGS PNSRVDSSGR h £ 1 31 240 



721 AGG ATC CCT CTC TAC AAA GGC AAG TCT CTC AGG AAG GCG CTC AAG GAG CAT GOG CTT CTC 



780 
260 



FTHURE 1 (Cont^d) 



781 GSC GAC TIC CIG CMS AAACASCW3TATGQCATCAQCMCAKSTACTOCOQCTK: OIC OIC 840 

« n P T. n K o o Y S I S S K X S_fi E V V 280 

T t 

841 TAT ACC GAC TGT AOC GAG TCC GOT CAG AAC CTC TOT CTC TOT GAG OGT TCC AAC OIC TOT 900 

2B1 Y T D C T B S G Q N h C L C E G :S ' V C 300 

901 GOT CAG GOT AAC AAG TOT ATC CTC OCT TCC GAC GOT GAG AAG AAC C«5 TOT GTC AOC GOT 960 

301G(?GWJCCXX,GSDGKJCi/OCVTG 320 



961 GAG GGA ACC OCA AAG CCA CAG TCC CAC AAC GAC GOT GAC TTT GAG GAG ATC CCA GAG GAG 



321 E 



1020 



SHNDGD rBBIPBB 340 



1021 TAT CTC CAG TAA agatctaagcttgctgctgctatcgaattcctgcagcccgggggatocactagttct:agagcggl096 
341 y 1. <? * 



